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MODIFICATION PROCESS 

The present invention relates to a modification process. 

5 In particular, the present invention relates to an in vivo modification process. 

Galactomannans are a heterogenous group of cell wall polysaccharides consisting of 
a 0-1-4 linked mannan backbone with varying numbers of a-1-6 linked galactose side 
chains. 

10 

The galactomannans of most significant industrial use are obtained from the 
endosperms of the legumes guar (Cyamopsis tetragonolobus) and locust bean 
(Cerawnia siliqua). These galactomannans differ in their galactose content, guar 
having a galactose to mannose ratio of approximately 1:1.6, whereas the ratio for 
15 locust bean gum (LBG) is approximately 1:3.4. 

The differences in galactose content have significant effects on the functional 
properties of guar gum and LBG. Both galactomannans form highly viscous solutions 
at low concentrations (1-2%) but LBG has the additional property of being able to 
10 form firm gels with other polysaccharides such as xanthan, carrageenan and agarose. 
LBG is extensively used by the food industry in dairy products (notably ice cream), 
salad dressings, sauces, low calorie products and pet foods. However, the use of 
LBG is restricted by the high price and irregular supply. 

!5 Therefore, large scale production of galactomannans with improved functional 
properties, such as a result of an increased mannose to galactose ratio (such as that 
similar to LBG), is desirable. 

Due to the generic chemical similarity between guar gum and LBG and the much 
0 lower price on guar gum, it has been attempted in vitro to convert guar gum into a 
galactomannan with LBG-like properties and with a chemical composition similar to 
LBG. 
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An example of such an in vitro treatment includes the use of a-galactosidase. In this 
regard, see McCleary et al 1983 and EP-A-0255153. 

By using a-galactosidase purified from guar seeds, guar gum with galactose contents 
5 of 10-34% was obtained (Bulpin et al 1990). Analysis of the gelation behaviour of 
the modified guar gum showed that a preparation with a galactose content of 24% 
formed mixed gels with carrageenan displaying similar Theological properties as LBG. 
In comparison, the galactose content of untreated guar gum was 38% and 23% for 
LBG. 

10 

However, from an industrial point of view in vitro degalactosylation of guar gum is 
associated with a number of problems. 

First, huge amounts of a-galactosidase have to be prepared as about 40% of the 
15 galactose in the guar gum has to be removed. 

Second, during incubation it is very important that no hydrolysis of the mannan 
backbone occurs necessitating the use of highly purified a-galactosidase preparations 
devoid of any trace of mannanase activity. A procedure for heterologous production 
20 of the a-galactosidase from guar seed has been published (Overbeeke et al 1986). 
However, the produced a-galactosidases from the tested species were purified before 
investigating the action on guar gum suggesting that the mannanase problem remains 
to be solved. 

25 Third, the yield of galactomannan is reduced because a 40% reduction in galactose 
content corresponds to approximately 15% less modified guar gum. The released 
galactose may be undesirable in the final product and may have to be removed. 

Fourth, there is a considerable risk for depolymerisation of the galactomannan during 
30 incubation with a-galactosidase. 



1/4/08, EAST Version: 2.1.0.14 



WO 97/20937 



PCT/EP96/05581 



3 

Also, there is a risk for contaminating microorganisms to colonise the reaction 
mixture releasing endo-0-mannanases. 

Finally, water has to be removed from the reaction mixture! In addition to the cost 
5 of this process, it will also result in concentration of the buffer that may be used for 
obtaining optimal reaction conditions. 

These examples demonstrate that the present methods for the modification of guar 
gum are associated with problems, some of which are associated with considerable 
10 costs. 

There is therefore need to have an improved method for the modification of guar 
gum. 

15 In this regard, we now realise that it would be beneficial if the modification of a 
mannose/galactose containing compound (such as guar gum) were to occur in vivo in 
plants, such as guar plants, by use of recombinant DNA techniques. 

Thus, in its broadest sense, the present invention relates to in vivo modification of a 
20 mannose/galactose containing compound - such as guar gum - in an organism (or part 
thereof) capable of synthesising that compound by a method that is not native to that 
organism - such as by a method that makes use of recombinant DNA techniques. The 
modification may occur to in relation to any one or more of the precursors of the 
compound (e.g. mannose and/or galactose) or in relation to the compound itself (i.e. 
25 modification of the mannose and/or galactose units of a compound comprising same). 

In particular, the present invention relates to an in vivo modification process that 
affects, preferably increases, the mannose-to-galactose ratio of either an organism (or 
part thereof) capable of producing a mannose/galactose containing compound or of 
30 a mannose/galactose containing compound thereof. This in vivo modification process 
is not a naturally occurring process. 
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Thus, with the in vivo process of the present invention, it is possible to alter the 
internal in vivo ratio of mannose to galactose within an organism and/or the ratio of 
mannose to galactose of a mannose/galactose compound thereof. 

One of the requirements for the production of in vivo modified guar gum is the 
availability of a method for the introduction of suitable genes into guar. This has 
been accomplished to a limited extent by Jorsboe and Okkefs (1994) who transferred 
a selectable and screenable gene used for the development of the transformation 
method. These authors did not report on transformation with a gene to affect the 
mannose to galactose ratio. This is an important point as, from a biotechnological 
point of view, the major obstacle for the production of in vivo modified guar gum is 
the lack of knowledge of galactomannan biosynthesis. Up until now, no genes or 
gene products which control the biosynthesis of guar gum in vivo have been isolated 
and characterised. However, we have now determined some of the genes or gene 
15 products which control the biosynthesis of guar gum in vivo - thus enabling us to 
modify guar gum in vivo. 

In one preferred aspect, the present invention relates to an in vivo modification 
process that affects, preferably increases, the mannose-to-galactose ratio of either an 
20 organism (or part thereof) capable of producing a mannose/galactose containing 
compound or of a mannose/galactose containing compound thereof, the in vivo 
modification process comprising expressing a nucleotide sequence coding for a gene 
product, which gene product has an effect on: 

25 (a) the mannose-to-galactose ratio of mannose and galactose components of a 
mannose/galactose containing compound; and/or 

(b) the mannose-to-galactose ratio of mannose and galactose precursors for a 
mannose/galactose containing compound; 



30 



and wherein the nucleotide sequence is not a natural nucleotide sequence to the 
organism (or part thereof). 
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In another preferred aspect, the present invention relates to an in vivo modification 
process that affects, preferably increases, the mannose-to-galactose ratio of either an 
organism (or pan thereof) capable of producing a mannose/galactose containing 
compound or of a mannose/galactose containing compound thereof, the in vivo 
5 modification process comprising allowing a gene product that is capable of having an 
effect on: 

(a) the mannose-to-galactose ratio of mannose and galactose components of a 
mannose/galactose containing compound; and/or 

10 

(b) the mannose-to-galactose ratio of mannose and galactose precursors for a 
mannose/galactose containing compound; 

to have an effect on: 

(a) the mannose-to-galactose ratio of mannose and galactose components of a 
mannose/galactose containing compound; and/or 

(b) the mannose-to-galactose ratio of mannose and galactose precursors for a 
20 mannose/galactose containing compound; 

and wherein the gene product has not been expressed by a nucleotide sequence that 
is a natural nucleotide sequence to the organism (or part thereof). 

25 Another broad aspect of the present invention relates to the use of a nucleotide 
sequence to affect in vivo, preferably to increase, the mannose-to-galactose ratio of 
either an organism (or part thereof) capable of producing a mannose/galactose 
containing compound or of a mannose/galactose containing compound thereof, 
wherein the nucleotide sequence encodes a gene product which has an effect on: 

30 

(a) the mannose-to-galactose ratio of mannose and galactose components of a 
mannose/galactose containing compound; and/or 
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(b) the mannose-to-galactose ratio of mannose and galactose precursors for a 
mannose/galactose containing compound; 

and wherein the nucleotide sequence is not a natural nucleotide sequence to the 
5 organism (or part thereof). 

Another broad aspect of the present invention relates to the use of a gene product to 
affect in vivo, preferably to increase, the mannose-to-galactose ratio of either an 
organism (or part thereof) capable of producing a mannose/galactose containing 
10 compound or of a mannose/galactose containing compound thereof, wherein the gene 
product has an effect on: 

(a) the mannose-to-galactose ratio of mannose and galactose components of a 
mannose/galactose containing compound; and/or 

15 

(b) the mannose-to-galactose ratio of mannose and galactose precursors for a 
mannose/galactose containing compound; 

and wherein the gene product is not expressed by a nucleotide sequence that is a 
20 natural nucleotide sequence to the organism (or part thereof). 

The term " mannose/galactose containing compound" means a compound comprising 
at least one mannose group and at least one galactose group. 

25 In each of these preferred aspects, it is preferred that the mannose/galactose 
containing compound is galactomannan. 

In each of these preferred aspects, it is more preferred that the mannose/galactose 
containing compound is guar gum. 

30 
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In each of these preferred aspects, it is more preferred that the organism capable of 
producing a mannose/galactose containing compound is a guar plant and the 
mannose/galactose containing compound thereof is galactomannan. However, other 
galactomannan producing plants are encompassed such as fenugreek and lucerne. 
5 Plants that are considered not to produce appropriate quantities of galactomannan 
belong to the family Solanacea and the species Nicotiana tabacum. 

The term "organism (or pan thereof) capable of producing a mannose/galactose 
containing compound" also includes any suitable organism - in particular a plant - 
capable of producing a mannose/galactose containing compound, such that the internal 
in vivo ratio of mannose to galactose of that organism is altered. The term also 
includes any part of an organism that is capable of producing a mannose/galactose 
containing compound, such that the ratio of mannose to galactose of that part is 
altered. The term also includes a part when within an organism or in a live culture 
medium. Preferably, the part is when within an organism per se. An example of a 
part is seed. 

The term "a natural nucleotide sequence to the organism" means an entire nucleotide 
sequence that is in its natural environment and when operatively linked to an entire 
20 promoter with which it is naturally associated, which promoter is also in its natural 
environment. 

The term "mannose and galactose precursors" includes mannose per se or derivatives 
thereof and/or galactose per se or derivatives thereof as precursors for the 

25 biosynthesis of a mannose/galactose containing compound, preferably galactomannan. 
In addition, the term includes precursors for mannose per se or derivatives thereof 
and/or galactose per se or derivatives thereof which in turn are used as precursors for 
the biosynthesis of a mannose/galactose containing compound, preferably 
galactomannan. Preferably, the term means mannose per se or derivatives thereof 

30 (such as mannose-6-phosphate or GDP-mannose) and/or galactose per se or 
derivatives thereof as precursors for the biosynthesis of galactomannan, preferably 
guar galactomannan. 



10 



15 
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The term "gene product" includes peptides, polypeptides, proteins, enzymes and 
RNA. Preferably, the term means an enzyme. 

Preferably the in vivo mannose-to-galactose ratio of the organism (or part thereof) or 
5 mannose/galactose containing compound thereof is higher than that of the guar plant 
or the galactomannan thereof. 

More preferably the in vivo mannose-to-galactose ratio of the organism (or part 
thereof) or mannose/galactose containing compound thereof is substantially similar 
10 to that of the locust bean or the galactomannan thereof. 

Preferably the organism (or part thereof) or mannose/galactose containing compound 
thereof is a guar plant or the gum thereof. 

15 The present invention also covers a mannose/galactose containing compound when 
prepared by the process of the present invention. This mannose/galactose containing 
compound will be referred to as a mannose/galactose containing compound according 
to the present invention. 

20 In addition, the present invention also covers a foodstuff comprising a 
mannose/galactose containing compound according to the present invention. 

In addition, the present invention also covers a composition - such as a foodstuff - 
comprising a mannose/galactose containing compound according to the present 
25 invention admixed with another polysaccharide. Preferably that other saccharide is 
any one or more of xanthan, carrageenan and agarose. 

In addition, the present invention covers methods for preparing, compositions or 
foodstuffs according to the present invention comprising mixing the 
30 mannose/galactose containing compound according to the present invention with 
another suitable ingredient. 
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The broad aspects of the present invention can be achieved by one or more 
appropriate strategies, wherein each strategy constitutes a preferred embodiment of 
the present invention. 

5 A first strategy relates to the use of one or more gene products, or nucleotide 
sequences coding for same, wherein the gene products are useful in the biosynthesis 
of GDP-mannose. This strategy involves the transformation of one or more of the 
genes encoding enzymes that are required for the biosynthesis of GDP-mannose - 
namely the enzyme phosphomannose isomerase (PMI) and/or the enzyme 
10 phosphomannose mutase and/or the enzyme GDP-mannose pyrophosphorylase. 

In this regard, it is believed that one or more of the gene products that are useful in 
the biosynthesis of GDP-mannose increase the levels of mannose-6-phosphate, which 
in turn increase the mannose-to-galactose ratio of a mannose/galactose containing 
15 compound, such as a galactomannan. 

A preferred aspect of the first strategy relates to at least the use of PMI and/or the 
nucleotide sequence coding for the same. In this regard, it is believed that the PMI 
gene product increases the levels of mannose-6-phosphate, which in turn increases the 
20 mannose-to-galactose ratio of a mannose/galactose containing compound, such as a 
galactomannan. It is further preferred that the PMI is a plant PMI. 

A second strategy relates to the use of a a-galactosidase enzyme and the nucleotide 
sequence coding for the same. With this strategy it is possible to utilise a- 
25 galactosidase. such as that from senna or from coffee beans, to alter in vivo the 
mannose-to-galactose ratio of a mannose/galactose containing compound, such as a 
galactomannan. 

A third strategy relates to the combination of the first strategy with the second 
30 strategy, which strategies can be used in any order or simultaneously. 
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A preferred aspect of the present invention relates to a construct comprising or 
expressing any one or more of the nucleotide sequences of the present invention. 



Another preferred aspect of the present invention relates to a vector comprising or 
5 expressing any one or more of the constructs or nucleotide sequences of the present 
invention. 

Another preferred aspect of the present invention relates to a plasmid comprising or 
expressing any one or more of the vectors, constructs or nucleotide sequences of the 
10 present invention. 

Another preferred aspect of the present invention relates to a transgenic organism (or 
part thereof) comprising or expressing any one or more of the plasmids, vectors, 
constructs or nucleotide sequences of the present invention. 

15 

Other preferred aspects of the present invention include methods of expressing or 
allowing expression or transforming any one or more of the nucleotide sequences, the 
constructs, the plasmids, the vectors, the cells, the tissues, the organs or the 
organisms, as well as the products thereof. 

20 

Further preferred aspects of the present invention include uses of the gene products 
for preparing or treating foodstuffs, including animal feed. 

The present invention also relates to isolating a guar gum prepared by the method of 
25 the present invention. 

The present invention also relates to a guar gum prepared by the method of the 
present invention. 

30 The first strategy of the present invention will now be described in more detail by 
way of reference to further preferred aspects of the present invention. 
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According to a first preferred aspect of this aspect of the present invention there is 
provided an enzyme comprising the amino acid sequence shown in Figure 1, or a 
variant, homologue or fragment thereof. 

5 According to a second preferred aspect of this aspect of the present invention there 
is provided a nucleotide sequence coding for the enzyme of the first aspect or a 
sequence that is complementary thereto. 

According to a third preferred aspect of this aspect of the present invention there is 
10 provided a nucleotide sequence comprising the sequence shown in Figure 1, or a 
variant, homologue or fragment thereof or a sequence that is complementary thereto. 

According to a fourth preferred aspect of this aspect of the present invention there is 
provided a construct comprising or expressing the invention according to any one of 
15 the earlier aspects. 

According to a fifth preferred aspect of this aspect of the present invention there is 
provided a vector comprising or expressing the invention of any one of the earlier 
aspects. 

20 

According to a sixth preferred aspect of this aspect of the present invention there is 
provided a plasmid comprising or expressing the invention of any one of the earlier 
aspects. 

25 According to a seventh preferred aspect of this aspect of the present invention there 
is provided a transgenic organism (or part thereof) comprising or expressing the 
invention according to any one of the earlier aspects. 

Preferably in these preferred aspects of this aspect of the present invention the 
30 nucleotide sequence or the enzyme is that as defined in or is contained within or is 
expressed by the above-mentioned aspects of the present invention. 
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Other preferred aspects of this aspect of the present invention include methods of 
expressing or allowing expression or transforming any one of the nucleotide sequence, 
the construct, the plasmid, the vector, the cell, the tissue, the organ or the organism, 
as well as the products thereof. 

5 

Further preferred aspects of this aspect of the present invention include uses of the 
enzyme for preparing or treating foodstuffs, including animal feed. 

A preferred aspect of this aspect of the present invention thus relates to the enzyme 
10 phosphomannose isomerase ("PMI") and a nucleotide sequence coding for that 
enzyme. In particular, the preferred aspect of the present invention relates to 
recombinant PMI. 

In addition, the preferred aspect of the present invention relates to the use of that 
15 recombinant PMI to alter the mannose-to-galactose ratio of either an organism (or 
part thereof) and/or a mannose/galactose containing compound thereof, especially the 
mannose-to-galactose ratio of galactomannan. 

One of the key advantages of the present invention is that by using the recombinant 
20 PMI it is possible to increase the mannose-to-galactose ratio of an organism (or part 
thereof) and/or a mannose/galactose containing compound thereof, in particular in 
vivo modified guar gum. This advantageous aspect is achieved by the insertion of a 
gene or genes encoding a gene product or products which are involved in the 
biosynthesis of mannose/galactose containing compounds such as mannose-6- 
25 phosphate, which gene is most preferably the nucleotide sequence of the present 
invention. 

Other key advantages are that the recombinant enzyme can be prepared easily and in 
large quantities. Also, the nucleotide sequence can be used to change the in vivo ratio 
30 of mannose to galactose levels when inserted (preferably stably inserted) into the 
genome of an organism (or part thereof). 
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Preferably the nucleotide sequence is a DNA sequence. 

« 

In a highly preferred embodiment the nucleotide sequence is a recombinant DNA 
sequence. 

Preferably the nucleotide sequence is obtainable from deposit NCIMB 40774. 

In a highly preferred embodiment the enzyme is expressed by use of recombinant 
DNA techniques. 

Preferably the enzyme is expressed by a nucleotide sequence obtainable from deposit 
NCIMB 40774. 

Preferably the organism is a plant. 
More preferably the plant is a guar plant. 

Preferably the mannose/galactose containing compound is guar gum. 

The enzyme or nucleotide sequence(s) coding for same may be used in vivo in 
combination with one or more other enzymes or nucleotide sequence(s) coding for 
same, which enzymes or nucleotide sequence(s) coding for same are preferably 
prepared by use of recombinant DNA techniques. The PMI enzyme or nucleotide 
sequence(s) coding for same may also be used in vitro. The PMI enzyme or 
nucleotide sequence(s) coding for same may also be used with one or more other 
enzymes or nucleotide sequence(s) coding for same, which enzymes or nucleotide 
sequence(s) coding for same are preferably prepared by use of recombinant DNA 
techniques. 
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The terms "variant", "homologue" or "fragment" in relation to the enzyme include 
any substitution of, variation of, modification of, replacement of, deletion of or 
addition of one (or more) amino acid from or to the sequence providing the resultant 
amino acid sequence has PMI activity, preferably having at least the same activity of 

5 the enzyme shown in Figure 1 . In particular, the term "homologue" covers homology 
with respect to structure and/or function providing the resultant enzyme has PMI 
activity. With respect to sequence homology, preferably there is at least 75%, more 
preferably at least 85%, more preferably at least 90% homology to the sequence 
shown in Figure I. More preferably there is at least 95%, more preferably at least 

10 98%, homology to the sequence shown in the attached Figure 1. 

The terms "variant", "homologue" or "fragment" in relation to the nucleotide 
sequence coding for the enzyme include any substitution of, variation of, modification 
of, replacement of, deletion of or addition of one (or more) nucleic acid from or to 

15 the sequence providing the resultant nucleotide sequence codes for an enzyme having 
PMI activity, preferably having at least the same activity of the enzyme shown in 
Figure 1. In particular, the term "homologue" covers homology with respect to 
structure and/or function providing the resultant nucleotide sequence codes for an 
enzyme having PMI activity. With respect to sequence homology, preferably there 

20 is at least 75%, more preferably at least 85%, more preferably at least 90% 
homology to the sequence shown in Figure 1. More preferably there is at least 95%, 
more preferably at least 98%, homology to the sequence shown in Figure 1. 

The above terms are synonymous with allelic variations of the sequences. 

25 

The term "complementary" means that the present invention also covers nucleotide 
sequences that can hybridise to the nucleotide sequence of the present invention. 

The second strategy of the present invention will now be described in more detail by 
30 way of reference to further preferred aspects of the present invention. 
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According to a first preferred aspect of this aspect of the present invention there is 
provided an enzyme comprising the amino acid sequence shown in Figure 4, or a 
variant, homologue or fragment thereof. 

5 According to a second preferred aspect of this aspect of the present invention there 
is provided a nucleotide sequence coding for the enzyme of the first aspect or a 
sequence that is complementary thereto. 

According to a third preferred aspect of this aspect of the present invention there is 
10 provided a nucleotide sequence comprising the sequence shown in Figure 4, or a 
variant, homologue or fragment thereof or a sequence that is complementary thereto. 

According to a fourth preferred aspect of this aspect of the present invention there is 
provided a construct comprising or expressing the invention according to any one of 
15 the earlier aspects. 

According to a fifth preferred aspect of this aspect of the present invention there is 
provided a vector comprising or expressing the invention of any one of the earlier 
aspects. 

20 

According to a sixth preferred aspect of this aspect of the present invention there is 
provided a plasmid comprising or expressing the invention of any one of the earlier 
aspects. 

25 According to a seventh preferred aspect of this aspect of the present invention there 
is provided a transgenic organism (or pan thereof) comprising or expressing the 
invention according to any one of the earlier aspects. 

Preferably in these preferred aspects of this aspect of the present invention the 
30 nucleotide sequence or the enzyme is that as defined in or is contained within or is 
expressed by the above-mentioned aspects of the present invention. 
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Other preferred aspects of this aspect of the present invention include methods of 
expressing or allowing expression or transforming any one of the nucleotide sequence, 
the construct, the plasmid, the vector, the cell, the tissue, the organ or the organism, 
as well as the products thereof. 

5 

Further preferred aspects of this aspect of the present invention include uses of the 
enzyme for preparing or treating foodstuffs, including animal feed. 

A preferred aspect of this aspect of the present invention thus relates to the enzyme 
10 a-galactosidase and a nucleotide sequence coding for that enzyme. 

In particular, the preferred aspect of the present invention relates to recombinant a- 
galactosidase. 

15 In addition, the preferred aspect of the present invention relates to the use of that 
recombinant a-galactosidase to alter the mannose-to-galactose ratio of either an 
organism (or part thereof) and/or a mannose/galactose containing compound thereof, 
especially the mannose-to-galactose ratio of galactomannan. 

20 One of the key advantages of the present invention is that by using the recombinant 
a-galactosidase it is possible to increase the mannose-to-galactose ratio of an 
organism (or part thereof) and/or a mannose/galactose containing compound thereof, 
in particular in vivo modified guar gum. 

25 Other key advantages are that the recombinant enzyme can be prepared easily and in 
large quantities. Also, the nucleotide sequence can be used to change the in vivo ratio 
of mannose to galactose levels when inserted (preferably stably inserted) into the 
genome of an organism (or part thereof). 

30 Preferably the nucleotide sequence is a DNA sequence. In a highly preferred 
embodiment the nucleotide sequence is a recombinant DNA sequence. 



1/4/08, EAST Version: 2.1.0.14 



WO 97/20937 PCT/EP 96/05581 

17 

Preferably the nucleotide sequence is obtainable from deposit NCIMB 40831. 



In a highly preferred embodiment the enzyme is expressed by use of recombinant 
DNA techniques. 

5 

Preferably the enzyme is expressed by a nucleotide sequence obtainable from deposit 
NCIMB 40831. 

Preferably the organism is a plant. 

10 

More preferably the plant is a guar plant. 

Preferably the mannose/galactose containing compound is guar gum. 

15 The enzyme or nucleotide sequence(s) coding for same may be used in vivo in 
combination with one or more other enzymes or nucleotide sequence(s) coding for 
same, which enzymes or nucleotide sequence(s) coding for same are preferably 
prepared by use of recombinant DNA techniques. The a-galactosidase enzyme or 
nucleotide sequence(s) coding for same may also be used in vitro. The a- 

20 galactosidase enzyme or nucleotide sequence(s) coding for same may also be used 
with one or more other enzymes or nucleotide sequence(s) coding for same, which 
enzymes or nucleotide sequence(s) coding for same are preferably prepared by use 
of recombinant DNA techniques. 

25 The terms "variant", "homologue" or "fragment" in relation to the enzyme include 
any substitution of, variation of, modification of, replacement of, deletion of or 
addition of one (or more) amino acid from or to the sequence providing the resultant 
amino acid sequence has a-galactosidase activity, preferably having at least the same 
activity of the enzyme shown in Figure 4. In particular, the term "homologue" 

30 covers homology with respect to structure and/or function providing the resultant 
enzyme has a-galactosidase activity. With respect to sequence homology, preferably 
there is at least 75%, more preferably at least 85%, more preferably at least 90% 
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homology to the sequence shown in Figure 4. More preferably there is at least 95% , 
more preferably at least 98%, homology to the sequence shown in the attached Figure 
4. 

5 The terms "variant", "homologue" or "fragment" in relation to the nucleotide 
sequence coding for the enzyme include any substitution of t variation of, modification 
of, replacement of, deletion of or addition of one (or more) nucleic acid from or to 
the sequence providing the resultant nucleotide sequence codes for an enzyme having 
a-galactosidase activity, preferably having at least the same activity of the enzyme 

10 shown in Figure 4. In particular, the term "homologue" covers homology with 
respect to structure and/or function providing the resultant nucleotide sequence codes 
for an enzyme having a-galactosidase activity. With respect to sequence homology, 
preferably there is at least 75%, more preferably at least 85%, more preferably at 
least 90% homology to the sequence shown in Figure 4. More preferably there is at 

15 least 95%, more preferably at least 98%, homology to the sequence shown in Figure 
4. 

The above terms are synonymous with allelic variations of the sequences. 

20 The term "complementary" means that the present invention also covers nucleotide 
sequences that can hybridise to the nucleotide sequence of the present invention. 

The term "nucleotide" in relation to the present invention includes genomic DNA, 
cDNA, synthetic DNA, and RNA. Preferably it means DNA, more preferably cDNA 
25 for the coding sequence of the present invention. 

The term "construct" - which is synonymous with terms such as "conjugate", 
"cassette" and "hybrid" - includes the nucleotide sequence directly or indirectly 
attached or fused to a promoter. An example of an indirect attachment is the 
30 provision of a suitable spacer group such as an intron sequence, such as the Shi- 
intron or the ADH intron, intermediate the promoter and the nucleotide sequence. 
In each case, it is highly preferred that the terms do not cover the natural combination 
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of the wild type gene coding for the enzyme ordinarily associated with the wild type 
gene promoter and when they are both in their natural environment. One highly 
preferred embodiment of the present invention therefore relates to the nucleotide 
sequence of the present invention operatively linked to a heterologous promoter. 

5 

The construct may even contain or express a marker which allows for the selection 
of the genetic construct in. for example, a plant, such as guar, into which it has been 
transferred. Various markers exist which may be used, such as for example those 
encoding mannose-6-phosphate isomerase (especially for plants) or those markers that 
10 provide for herbicide or antibiotic resistance - e.g. resistance to G418, hygromycin, 
bleomycin, kanamycin and gentamycin. 

The term "vector" includes expression vectors and transformation vectors. 

15 The term "expression vector" means a construct capable of in vivo or in vitro 
expression. 

The term "transformation vector" means a construct capable of being transferred from 
one species to another - such as from an E.coli plasmid to an Agrobacterium to a 
20 plant. 

The term "tissue" includes tissue per se and organ. 

The term "organism" in relation to the present invention includes any organism that 
25 could comprise the nucleotide sequence coding for the enzyme according to the 
present invention and/or products obtained therefrom, and/or wherein the nucleotide 
sequence according to the present invention can be expressed when present in the 
organism. 

30 Preferably the organism is a guar plant. 
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The term "transgenic organism" in relation to the present invention includes any 
organism that comprises the nucleotide sequence coding for the enzyme according to 
the present invention and/or products obtained therefrom, and/or wherein the 
nucleotide sequence according to the present invention can be expressed within the 
5 organism. Preferably the nucleotide sequence is incorporated in the genome of the 
organism. 

Preferably the transgenic organism is a plant, more preferably a guar plant. 

10 The transgenic organism of the present invention includes an organism comprising 
any one or more of the nucleotide sequences coding for the enzymes according to the 
present invention, constructs according to the present invention, vectors according to 
the present invention, plasmids according to the present invention, cells according to 
the present invention, tissues according to the present invention, or the products 

15 thereof, including combinations thereof. For example the transgenic organism can 
also comprise any one or more of the nucleotide sequences coding for the enzymes 
of the present invention under the control of one or more heterologous promoters. 

In a highly preferred embodiment, the transgenic organism (or part thereof) does not 
20 comprise the combination of a promoter and the nucleotide sequence coding for the 
enzyme according to the present invention, wherein both the promoter and the 
nucleotide sequence are native to that organism (or part thereof) and are in their 
natural environment. Thus, in this highly preferred embodiment, the present 
invention does not cover the native nucleotide coding sequence according to the 
25 present invention in its natural environment when it is under the control of its native 
promoter which is also in its natural environment. In addition, in this highly 
preferred embodiment, the present invention does not cover the native enzyme 
according to the present invention when it is in its natural environment and when it 
has been expressed by its native nucleotide coding sequence which is also in its 
30 natural environment and when that nucleotide sequence is under the control of its 
native promoter which is also in its natural environment. 
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The term "promoter 1 ' is used in the normal sense of the art t e.g. an RNA polymerase 
binding site in the Jacob-Mond theory of gene expression. 



The promoter could additionally include one or more features to ensure or to increase 
5 expression in a suitable host. For example, the features can be conserved regions 
such as a Pribnow Box or a TATA box. The promoters may even contain other 
sequences to affect (such as to maintain, enhance, decrease) the levels of expression 
of the nucleotide sequence of the present invention. For example, suitable other 
sequences include the SA/-intron or an ADH intron. Other sequences include 
10 inducible elements - such as temperature, chemical, light or stress inducible elements. 

Also, suitable elements to enhance transcription or translation may be present. An 
example of the latter element is the TMV 5' signal sequence (see Sleat Gene 217 
[1987] 217-225; and Dawson Plant Mol. Biol. 23 [1993] 97). 

15 

Thus, in one aspect, the nucleotide sequence according to the present invention is 
under the control of a promoter that allows expression of the nucleotide sequence. 
In this aspect, the promoter may be a cell or tissue specific promoter. If, for 
example, the organism is a plant then the promoter can be one that affects expression 
20 of the nucleotide sequence in any one or more of seed, stem, sprout, root and leaf 
tissues. 

By way of example, the promoter for the nucleotide sequence of the present invention 
can be the a-Amy 1 promoter (otherwise known as the Amy 1 promoter, the Amy 
25 637 promoter or the ex-Amy 637 promoter) as described in PCT/EP95/02195. 

Alternatively, the promoter for the nucleotide sequence of the present invention can 
be the a-Amy 3 promoter (otherwise known as the Amy 3 promoter, the Amy 351 
promoter or the a-Amy 351 promoter) as described in PCT/EP95/02196. With the 
30 Amy 351 promoter it is possible to inactivate a part of it so that the partially 
inactivated promoter expresses the nucleotide sequence in a more specific manner 
such as in just one specific tissue type or organ. The term "inactivated" means partial 
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inactivation in the sense that the expression pattern of the promoter is modified but 
wherein the partially inactivated promoter still functions as a promoter. However, 
as mentioned above, the modified promoter is capable of expressing the nucleotide 
sequence in at least one (but not all) specific tissue of the original promoter. 
5 • Examples of other partial inactivation of a promoter sequence (and not just necessarily 
that of the Amy 351 promoter) include altering the folding pattern of the promoter 
sequence, or binding species to parts of the nucleotide sequence, so that a part of the 
nucleotide sequence is not recognised by, for example, RNA polymerase. Another, 
and preferable, way of partially inactivating the Amy 351 promoter is to truncate it 
10 to form fragments thereof. Another way would be to mutate at least a part of the 
sequence so that the RNA polymerase can not bind to that pan or another pan. 

Another modification is to mutate the binding sites for regulatory proteins for 
example the CreA protein known from filamentous fungi to exert carbon catabolite 
15 repression, and thus abolish the catabolite repression of the native promoter. 

General teachings of recombinant DNA techniques may be found in Sambrook,J., 
Fritsch, E.F., Maniatis T. (Editors) Molecular Cloning. A laboratory manual. Second 
edition. Cold Spring Harbour Laboratory Press. New York 1989. 

20 

Even though the enzyme and the nucleotide sequence of the present invention are not 
disclosed in EP-B-0470145 and CA-A-2006454, those two documents do provide 
some useful background commentary on the types of techniques that may be 
employed to prepare transgenic plants according to the present invention. An 
25 adaption of some of these background teachings is now included in the following 
commentary. 

The basic principle in the construction of genetically modified plants is to insert 
genetic information in the plant genome so as to obtain a stable maintenance of the 
30 inserted genetic material. 
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Several techniques exist for inserting the genetic information, the two main principles 
being direct introduction of the genetic information and introduction of the genetic 
information by use of a vector system. A review of the general techniques may be 
found in articles by Potrykus (Annu Rev Plant Physiol Plant Mol Biol [1991] 42:205- 
5 225) and Christou (Agro-Food-Industry Hi-Tech March/ April 1994 17-27). 

Thus, in one aspect, the present invention relates to a vector system which carries the 
nucleotide sequence or construct according to the present invention and which is 
capable of introducing the nucleotide sequence or construct into the genome of an 
10 organism, such as a plant. 

The vector system may comprise one vector, but it can comprise two vectors. In the 
case of two vectors, the vector system is normally referred to as a binary vector 
system. Binary vector systems are described in ftirther detail in Gynheung An et al. 
15 (1980), Binary Vectors, Plant Molecular Biology Manual A3,. 1-19. 

One extensively employed system for transformation of plant cells with a given 
promoter or nucleotide sequence or construct is based on the use of a Ti plasmid from 
Agrobacterium tumefaciens or a Ri plasmid from Agrobacterium rhizogenes An et al. 
20 (1986), Plant Physiol. 81, 301-305 and Butcher D.N. et al. (1980), Tissue Culture 
Methods for Plant Pathologists, eds.: D.S. Ingrams and J. P. Helgeson, 203-208. 

Several different Ti and Ri plasmids have been constructed which are suitable for the 
construction of the plant or plant cell constructs described above. A non-limiting 
25 example of such a Ti plasmid is pGV3850. 

The nucleotide sequence or construct of the present invention should preferably be 
inserted into the Ti-plasmid between the terminal sequences of the T-DN A or adjacent 
a T-DNA sequence so as to avoid disruption of the sequences immediately 
30 surrounding the T-DNA borders, as at least one of these regions appear to be 
essential for insertion of modified T-DNA into the plant genome. 
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As will be understood from the above explanation, if the organism is a plant, then the 
vector system of the present invention is preferably one which contains the sequences 
necessary to infect the plant (e.g. the vir region) and at least one border part of a T- 
DNA sequence, the border pan being located on the same vector as the genetic 
5 construct. 

Furthermore, the vector system is preferably an Agrobacterium tumefaciens Ti- 
plasmid or an Agrobacterium rhizogenes Ri-plasmid or a derivative thereof, as these 
plasmids are well-known and widely employed in the construction of transgenic 
10 plants, many vector systems exist which are based on these plasmids or derivatives 
thereof. 

In the construction of a transgenic plant the nucleotide sequence or construct of the 
present invention may be first constructed in a microorganism in which the vector can 

15 replicate and which is easy to manipulate before insertion into the plant. An example 
of a useful microorganism is E. coli, but other microorganisms having the above 
properties may be used. When a vector of a vector system as defined above has been 
constructed in E. coli, it is transferred, if necessary, into a suitable Agrobacterium 
strain, e.g. Agrobacterium tumefaciens. The Ti-plasmid harbouring the nucleotide 

20 sequence or construct of the invention is thus preferably transferred into a suitable 
Agrobacterium strain, e.g. A. tumefaciens, so as to obtain an Agrobacterium cell 
harbouring the nucleotide sequence or construct of the invention, which DNA is 
subsequently transferred into the plant cell to be modified. 

25 As reported in CA-A-2006454, a large amount of cloning vectors are available which 
contain a replication system in E. coli and a marker which allows a selection of the 
transformed cells. The vectors contain for example pBR 322, pUC series, M13 mp 
series, pACYC 184 etc. 

30 In this way, the nucleotide or construct of the present invention can be introduced into 
a suitable restriction position in the vector. The contained plasmid is used for the 
transformation in £. coli. The E. coli cells are cultivated in a suitable nutrient medium 
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and then harvested and lysed. The plasmid is then recovered. As a method of 
analysis there is generally used sequence analysis, restriction analysis, electrophoresis 
and further biochemical-molecular biological methods. After each manipulation, the 
used DNA sequence can be restricted and connected with the next DNA sequence. 
5 Each sequence can be cloned in the same or different plasmid. 



After each introduction method of the construct or nucleotide sequence according to 
the present invention in the plants the presence and/or insertion of further DNA 
sequences may be necessary. If, for example, for the transformation the Ti- or Ri- 

10 plasmid of the plant cells is used, at least the right boundary and often however the 
right and the left boundary of the Ti- and Ri-plasmid T-DNA, as flanking areas of the 
introduced genes, can be connected. The use of T-DNA for the transformation of 
plant cells has been intensively studied and is described in EP-A-120516; Hoekema, 
in: The Binary Plant Vector System Offset-drukkerij Kanters B.B., Alblasserdam, 

15 1985, Chapter V; Fraley, et al.. Crit. Rev. Plant Sci., 4: 1-46; and An et al., EMBO 
J. (1985) 4:277-284. 

Direct infection of plant tissues by Agrobacterium is a simple technique which has 
been widely employed and which is described in Butcher D.N. et al. (1980), Tissue 

20 Culture Methods for Plant Pathologists, eds.: D.S.. Ingrams and J. P. Helgeson, 203- 
208. For further teachings on this topic see Potrykus (Annu Rev Plant Physiol Plant 
Mol Biol [1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech March/ April 
1994 17-27). With this technique, infection of a plant may be done on a certain part 
or tissue of the plant, i.e. on a pan of a leaf, a root, a stem or another part of the 

25 plant. 



Typically, with direct infection of plant tissues by Agrobacterium carrying the 
nucleotide sequence of the present invention, a plant to be infected is wounded, e.g. 
by cutting the plant with a razor or puncturing the plant with a needle or rubbing the 
30 plant with an abrasive. The wound is then inoculated with the Agrobacterium. The 
inoculated plant or plant part is then grown on a suitable culture medium and allowed 
to develop into mature plants. 
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When plant cells are constructed, these cells may be grown and maintained in 
accordance with well-known tissue culturing methods such as by culturing the cells 
in a suitable culture medium supplied with the necessary growth factors such as amino 
acids, plant hormones, vitamins, etc. 

5 

Regeneration of the transformed cells into genetically modified plants may be 
accomplished using known methods for the regeneration of plants from cell or tissue 
cultures, for example by selecting transformed shoots using an antibiotic and by 
subculturing the shoots on a medium containing the appropriate nutrients, plant 
10 hormones, etc. 

Further teachings on plant transformation may be found in EP-A-0449375 . 

Even further useful teachings on the transformation of guar plants can be found in 
15 Danish patent application No. 940662 (filed 10 June 1994). 

Thus, the present invention relates to the use of a gene product (e.g. the PM1 enzyme 
which is involved in the biosynthesis of mannose) to increase the mannose-to- 
galactose ratio of an organism (or part thereof) or mannose/galactose containing 
20 compound thereof. In addition, the present invention relates to that nucleotide 
sequence and the gene product it encodes. 

The present invention is based on the surprising finding that it is possible to increase 
the mannose-to-galactose ratio of guar gum by the insertion of a gene or genes 
25 encoding a gene product or products which are involved in the biosynthesis of 
mannose/galactose containing compounds such as mannose-6-phosphate, namely PMI. 

The findings of the present invention are in contrast to what would have been 
expected in view of the teachings of the art. In this regard, Edwards from analysis 
30 of unpurified preparations of developing endosperms from fenugreek and guar 
suggested that GDP-mannose may be a precursor for the biosynthesis of 
galactomannan (Edwards et al 1989, Reid and Edwards 1995). Through analysis of 
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various amounts of GDP-mannose in the reaction mixture the authors concluded that 
the control of the mannose-to-galactose ratio might reside at the level of specificity 
of the galactomannan-synthesizing glycosyltransferases themselves. This might 
suggest that the glycosyltransferases might be the critical target for genetic 
5 manipulation for the in vivo modification of guar gum. 

Thus, in summation, this preferred aspect of the present invention relates to the 
insertion of a phosphomannose isomerase gene into a plant, preferably guar. 

10 The rationale behind this strategy is based on our understanding that the mannose 
incorporated into the guar galactomannan is derived from GDP-mannose. However, 
the way in which the GDP-mannose is synthesized in guar is not known. The 
classical way is by isomerization of GDP-glucose but some preliminary data suggest 
that in some legumes at least the GDP-mannose might be synthesized by isomerization 

15 of fructose-6-phosphate to mannose-6-phosphate which is isomerized to mannose- 1- 
phosphate which in turn is converted to GDP-mannose. But even if the latter pathway 
should be operative an increase in the PMI activity would not per se be expected to 
affect the galactomannan composition because firstly the fructose-6-phosphate to 
mannose-6-phosphate isomerization is a fully reversible reaction and secondly Reid 

20 and Edwards (1995) have predicted that the mannose-to-galactose ratio is determined 
by the specificity of the galactomannan synthesizing enzymes. With this strategy, and 
as will become apparent, we have obtained an increase of the mannose-to-galactose 
ratio - even when using a weakly expressed promoter in guar. But, even higher ratio 
levels could be achieved by using a stronger promoter. 

25 

The present invention is also based on the surprising finding that it is possible to 
increase the mannose-to-galactose ratio of guar gum by the insertion of a gene or 
genes encoding a gene product or products which are involved in the biosynthesis of 
guar or precursors therefor. As just mentioned, in one preferred aspect of the present 
30 invention the gene codes for PMI. In an alternative preferred aspect the gene codes 
for ot-galactosidase, preferably coffee bean a-galactosidase or senna a-galactosidase, 
more preferably senna a-galactosidase. 
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The following sample comprising the PMI gene of the present invention was 
deposited in accordance with the Budapest Treaty at the recognised depositary The 
National Collections of Industrial and Marine Bacteria Limited (NCIMB) at 23 St. 
Machar Drive, Aberdeen, Scotland, United Kingdom, AB2 1RY on 9 November 
5 1995: 

E.coli K12 containing plasmid pPMI-60. 

The deposit number is NCIMB 40774. 

10 

The present invention also covers nucleotide sequences obtainable from that deposit 
and the products encoded thereby. 

The following sample comprising the a-galactosidase gene of the present invention 
15 was deposited in accordance with the Budapest Treaty at the recognised depositary 
The National Collections of Industrial and Marine Bacteria Limited (NCIMB) at 23 
St. Machar Drive, Aberdeen, Scotland, United Kingdom, AB2 1RY on 28 November 
1996: 

20 E.coli K12 containing plasmid pT7-SEcDNA5. 
The deposit number is NCIMB 40831. 

The present invention also covers nucleotide sequences obtainable from that deposit 
25 and the products encoded thereby. 

The present invention will now be described by way of example. In the following 
Examples reference is made to the accompanying figures in which: 

30 Figure 1 shows the amino acid sequence of one enzyme according to the present 
invention and the sequence of one nucleotide sequence according to the present 
invention; 
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Figure 2 is a plasmid map of pcDNAII; 

Figure 3 is a plasmid map of pSG-Man5; 

5 Figure 4 shows the amino acid sequence of another enzyme according to the present 
invention and the sequence of another nucleotide sequence according to the present 
invention; 

Figure 5 is a plasmid map of pPS48; 

10 

Figure 6 is a plasmid map of pPS48SEGAL; 
Figure 7 is a plasmid map of pBKL4; 
15 Figure 8 is a plasmid map of pBKL4SEGAL; 

Figure 9 is a plasmid map of pPS48-GALIII; and 
Figure 10 is a plasmid map of pBKL4GALIII. 

20 

IN VIVO MODIFICATION OF GUAR GUM USING PMI GENES 
Cloning of a Phosphomannose Isomerase (PMI) Gene from Guar 

25 A cDNA expression library represent mRNA from immature guar endosperm was 
constructed in the plasmid pcDNAII (Invitrogen Corporation) and transformed into 
the E coli strain ToplOF- (Invitrogen Corporation). The quality of the cDNA 
library was controlled by purification of plasmids from a number of separate colonies 
picked at random. Restriction enzyme digestion of the plasmids showed that all 

30 contained an insert of at least 500 bp. 
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The E coli strain CD1 man- contains an inactive PMI gene and is thus unable to 
metabolise mannose (Darzins et al 1985). This strain was used for the following 
complementation studies. 

5 CD1 man- cells were made competent by the method of Hanahan (1985). A titer of 
3-4 x 10 A transformed cells//*g library plasmid cDNA was obtained. A similar titer 
was found when the cells were transformed with a Bluescript control plasmid. 

Prior to the complementation studies, a number of control experiments were 
10 performed in which the transformed cells were plated onto selective medium 
containing M9-salts (Maniatis etal 1982) added 0.05 g/1 leucine, 0.05 g/1 methionine, 
0.05 g/l threonine, 1.0 g/1 thiamin-HCl, 50 mg/1 ampicillin, 6.0 g/1 mannose and 9.0 
g/1 agarose. This medium is hereafter called M9-SGP. 

15 In one experiment CD1 man- competent cells were transformed with the E coli PMI 
gene under control of its native promoter (Mills and Guest 1984) and in another 
experiment the cells were transformed with the E coli PMI gene under the control of 
the plant promoter CaMV 35S (pSGMANl, see Bojsen et al 1993). In both 
experiments, plating of the transformed cells on M9-SGP resulted in large numbers 

20 of large colonies. When competent CD1 man- cells were not transformed or 
transformed with a Bluescript control plasmid, no large colonies were obtained but 
a number of very small hardly visible colonies were observed. Thus the M9-SGP 
selective medium is suitable for selecting cells which contain an active PMI gene. 

25 Competent CD1 man- cells were transformed with plasmid DNA isolated from the 
guar endosperm cDNA library. Transformed CD1 man- cells were plated onto the 
selective substrate M9-SGP. After incubation for two days at 37 °C the majority of 
the plated cells appeared as very small colonies whereas less than 0.1% of the 
colonies were remarkably larger. 

30 

Plasmids from the larger colonies were purified and retransformed into competent 
CD1 man- cells. Twenty different retransformed colonies were assayed for PMI 
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activity in crude extracts. PMI activity was measured by the coupled enzymatic assay 
described by Gill et al (1986). One of the examined clones contained very high PMI 
activity. This clone is designated PMI-60 and is the subject of the deposit number 
NCIMB 40774. 

In order to test whether the insert in PMI-60 originated from guar, genomic guar 
DNA was purified from leaves according to Dellapona et al (1983) restriction 
enzyme digested. Southern blotted and probed with plasmid DNA derived from PMI- 
60 labelled with P-32 according to Feinberg and Vogelstein (1983). Hybridisation 
and washing were performed at 68°C, 6 x SSC and 68°C, 0.2 x SSC (Maniatis et al 
1982), respectively. The PMI-60 probe hybridised to a number of fragments of the 
digested guar genomic DNA. 

The insert in the plasmid of PMI-60 (called pPMI-60) was sequenced by dideoxy 
sequencing applying primer walking (Sanger and Coulson 1977), first by fluorescein 
labelled primers (reverse and universal) and subsequently by internal labelling using 
fluorescein-dATP. 



The size of the cDNA clone was 1.66 kb of which the PMI gene covered 1.29 kb. 
20 The PMI start and stop codon was located at 0.09 kb and 1 .38 kb, respectively. The 
insert contained a putative polyadenylation signal at 0.11 kb downstream from the 
stop codon and is poly-A terminated. 



15 



25 



The identity of the insert was further characterised by comparison to other known 
PMI sequences. At 137-145 amino acids downstream from translation start a 
conserved region is found: DGNHKPEM which is considered to be involved in the 
active PMI site of phosphomannose isomerases (Coulin et al 1993). 

Thus, the presence of the insert in PMI-60 results in rapid growth on selective 
30 mannose containing medium, high PMI activity and hybridisation to guar DNA. 
Furthermore, there is homology to some sequences from other PMI genes. These 
data demonstrate that the insert in PMI-60 is a PMI gene from guar. 
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This PMI sequence is the first PMI sequence ever cloned and sequenced from a plant. 

Transformation using phosphomannose isomerase (PMft gene 

5 The following transformation studies show that it is possible to increase the mannose- 
to-galactose ratio of guar gum by the insertion of a phosphomannose isomerase (PMI) 
- such as that from E.coli or guar. The recombinant PMI catalyses the conversion 
of fructose-6-phosphate to mannose-6-phosphate or mannose-6-phosphate to fructose- 
6-phosphate depending on the substrate available. 

10 

Transformation of Guar 

Transgenic guar plants were obtained by Agrobacterium tumefaciens mediated gene 
transfer as described by JArsboe and Okkels (1994) - the contents of which are 
15 incorporated herein by reference. The preferred Agrobacterium tumefaciens strain 
was in these studies LB A 4404. 

The insert in the T-DNA in the plasmid called pD018 contained 3 genes (right border 
to left border): a ^-glucuronidase (GUS) gene, a phosphomannose isomerase (PMI) 
20 gene and a neomycin phosphotransferase (NPTII) gene. The expression of each of 
the genes was driven by a 35S promoter described in detail on Bojsen et al (1993). 

Analysis of Guar Gum by HPLC 

25 - Pure guar endosperms devoid of embryo and seed coat were prepared by hand and 
treated repeatedly with 70% ethanol during homogenization according to Edwards et 
al (1992). The ethanol precipitate was added 2 ml 2 N trifluor acetic acid (TFA) and 
hydrolysis was performed at 120°C for 2 hours. The TFA was removed by 
evaporation at 50°C. The dried precipitate was dissolved in 500 y\ HPLC grade H 2 0 

30 and 25 /xl sample was applied onto an Aminex HPX-87P column (300 x 7.8 mm). 
The column was heated to 80° C in a column oven and eluted with H 2 0. Elution of 
saccharides was followed with an Rl-detector. Using this system mannose and 
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galactose was baseline separated. The respective peak areas were determined and the 
mannose-to-galactose ratios were calculated. Guar gum and LBG from Sigma were 
used as standards along with endosperm samples from non-transgenic guar plants. 
The mannose-to-galactose ratios obtained were 1.6:1 for both Sigma guar gum and 
non-transgenic endosperms, and 3.5: 1 for Sigma LBG. These ratios are in excellent 
agreement with those generally accepted (Reid and Edwards 1995). 

Mannose-to-Galactose Ratio in PMI-Transformed Guar 



Several independent guar transformants harbouring the PMI gene were analysed with 
respect to mannose-to-galactose ratio in the endosperm galactomannan, see the table 
below. 



Origin of Sample 


Mannose-to-Galactose Ratio 


Guar gum from Sigma 


1.60 


LBG from Sigma 


3.44 


Guar transformant 123-1 


1.64 


Guar transformant 123-2 


1.63 


Guar transformant 124-1 


1.74 



Each of the transformants increased the mannose-to-galactose ratio of guar gum. 

Similar studies are done with the insertion of a phosphomannose isomerase (PMI) 
gene from guar. In this regard, it is believed that the guar PMI catalyses the con- 
version of fnictose-6-phosphate to mannose-6-phosphate or mannose-6-phosphate to 
fructose-6-phosphate depending on the substrate available. 
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IN VIVO MODIFICATION OF CTJAR HIIM USING n-Cl A f A CTOSTD ASF, 
GENES 



In the following examples it is shown that the insertion of an a-galactosidase g< 
5 into guar can result in the in \ivo modification of guar gum. 

Cloning and sequencing of senna a-galactosidase cDNA. 

cDNA clones of a-galactosidase from senna endosperm were isolated by PCR 
10 follows. 



Total RNA was purified from senna endosperm according to the method of 
(Logemann etal. Anal. Biochem. 163 (1987) 16-20). Reverse transcription followed 
by PCR was done with the RT-PCR kit from Perkin Elmer and according to their 
15 protocol. Briefly, approximately 1 mg of total RNA and I mg oligo-dT was 
incubated at 42 °C for 45 min with reverse transcriptase followed by incubation at 
99°C for 5 min and 5°C for 5 min. 

For the following PCR, two oligo nucleotides derived from the guar a-galactosidase 
20 cDNA sequence (Overbeeke et al. Plant Mol. Biol 13 (1989) 541-550) were used: 



25 



PI (5 '-CAACGGGGCTTGCTGCTTTAGG) 



and 



P2 (5 ' -GCCTATGTC A-G ACC AGGATGC), 



at positions 415-437 and 1248-1270, respectively in the guar a-galactosidase cDNA 
sequence. 

30 

The PCR conditions were: 1 min at 94°C, 2 min at 55°C, 2 min at 72°C for 35 
cycles followed by 10 min at 72°C. 
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The PCR product was analysed by agarose gel electrophoresis, and a single product 
of 850 bp was obtained. 

The DNA product, designated SEGAL, was cloned into the pT7Blue vector 
5 (Novagen) and the nucleotide sequence was determined using a Termo sequenase 
fluorescent cycle sequencing kit (Amersham) and an ALF DNA sequencer 
(Pharmacia). 

The 3* and 5' ends of the senna a-galactosidase cDNA were obtained by the method 
10 called 3' and 5' RACE as previously described (Nielsen et al. Plant Mol. Biol 31 
(1996) 539-552. Briefly, for 3' RACE, approximately lmg of the above total RNA 
and 2.5 pmol of the primer: 

Q T (5' CCAGTGAGCAGAGTGACGAGGACTCGAGCTCAAGC(T) 17 ) 

15 

was incubated at 42°C for 45 min with reverse transcriptase followed by incubation 
at 99°C for 5 min and 5°C for 5 min. 

The cDNA was amplified by two rounds of PCR. 

20 

For the first PCR the downstream primer was: 
Qo (5 '-CC AGTGAGC AGAGTGACG) , 
25 and the upstream primer was: 

3 *GSP1 (5 '-GTCCTCTGAGTGATAACAGAGTGG) , 

a gene specific primer derived from the 850 bp nucleotide sequence of the fragment 
30 SEGAL, position 1096-1118 in the senna a-galactosidase cDNA (Figure 4). 

In the second PCR the downstream primer was: 
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Qi (5 *-G AGG ACTCGAGCTC A AGC) 



PCT/EP96/05581 



and the upstream primer was: 

5 3'GSP2 (5 ' -GGTGTTGTGGAATAG AAGTTC ATC) , 

a gene specific primer derived from the 850 bp nucleotide sequence of the fragment 
SEGAL, position 1123-1146 in the senna a-galactosidase cDNA (Figure 4). 

10 The PCR conditions for the first PCR were: 1 min at 94°C, 2 min at 60°C, 2 min 
at 72°C for 35 cycles followed by 10 min at 72°C. For the second PCR the 
conditions were: 1 min at 94°C, 2 min at 50°C, 2 min at 72°C for 35 cycles 
followed by 10 min at 72 °C. 

15 The PCR product after the second PCR was analysed by agarose gel electrophoresis, 
and a single product of 530 bp was obtained. 

The DNA product, designated 3' SEGAL, was cloned into the pT7Blue vector 
(Novagen) and the nucleotide sequence was determined using a Termo sequenase 
20 fluorescent cycle sequencing kit (Amersham) and an ALF DNA sequencer 
(Pharmacia). 

For 5' RACE the 5 'RACE system from Gibco BRL was employed with 3 gene 
specific primers constructed from the 850 bp nucleotide sequence of the fragment 
25 SEGAL. Briefly, approximately 1 mg of the same total RNA as used above and 2.5 
pmol of the gene specific primer: 

5'GSPl (5 '-TTGCACCTTGGTCTTCATGTCC), 

30 position 561-582 in the senna a-galactosidase cDNA (Figure 4) was incubated for 
70°C for 10 min followed by addition of reverse transcriptase and incubating at 42 °C 
for 30 min, 70°C for 15 min and the addition of RNaseH and incubating further 10 
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min at 55 °C. 

The cDNA was dC-tailed according to the protocol of Gibco BRL. The tailed cDNA 
was subjected to two rounds of PCR. 

5 

In the first PCR an upstream ANKER primer (5'-GGCCACGCGTC- 
GACTAGTACGGGGGGGGGG) was used with the gene specific downstream primer: 

5'GSP2 (5-C ATAGCTTTACTGCATGTTTGGTTTCC) , 

10 

position 510-536 in the senna a-galactosidase cDNA (Figure 4) 
In the second PCR an upstream UNI primer: 
15 (5 ' -GGCC ACGCGTCG ACT AGTACG) 

was used with the gene specific downstream primer: 
5'GSP3 (5 ' -C AGCC AG AGCCTTAATTCCTG A AGG) , 

20 

position 438-462 in the senna a-galactosidase cDNA (Figure 4). 

The PCR conditions for the first PCR were: 1 min at 94°C, 2 min at 51 °C, 2 min 
at 72 °C for 10 cycles followed by 1 min at 94°C, 2 min at 59°C, 2 min at 72 °C for 
25 25 cycles followed by 10 min at 72 °C. 

For the second PCR the conditions were: 1 min at 94°C, 2 min at 59°C, 2 min at 
72°C for 35 cycles followed by 10 min at 72°C. 

30 The PCR product after the second PCR was analysed by agarose gel electrophoresis, 
and a single product of 480 bp was obtained. 
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The DNA product, designated 5 'SEGAL, was cloned into the pT7Blue vector 
(Novagen) and the nucleotide sequence was determined using a Termo sequenase 
fluorescent cycle sequencing kit (Amersham) and an ALF DNA sequencer 
(Pharmacia). 

5 

The complete nucleotide sequence of the senna a-galactosidase cDNA, combined 
from the 3 PCR clones, 5'SEGAL, SEGAL and 3'SEGAL, obtained above, consisting 
of 1630 bp is shown in Figure 4. Analysis of the nucleotide sequence reveals (Figure 
4) an open reading frame encoding 406 amino acid residues with the first methionine 
10 at nucleotide position 93 and a translation stop codon at nucleotide position 1311. 
The deduced amino acid sequence is shown above the nucleotide sequence in Fig 4. 

Construction of a plant transformation vector containing the senna a- 
galactosidase. 

15 

An expression vector comprising the coding sequence for senna a-galactosidase was 
constructed as follows. 

The vector pPS48 (Figure 5) was constructed by inserting the 0.75 kb cauliflower 
20 mosaic virus (CaMV) 35S RNA promoter (E35S) containing a duplication of the -90 
to -420 region (Kay et al. Science 236 (1987) 1299-1302), the 0.21 kb fragment 
containing the CaMV 35S RNA polyadenylation sequence (Odell et al. Nature 313 
(1985) 810-812) and a synthetic oligonucleotide linker (PstVBamHl-Smal-SacVSati- 
Sphl) into pUC8 (Vieira and Messing, Gene 19 (1982) 259-268). 

25 

The three DNA fragments, 5' SEGAL, SEGAL and 3'SEGAL, were linked together 
by PCR to reconstitute a clone of senna a-galactosidase cDNA, containing the coding 
sequence and most of the non translated 5' end. 

30 

The fragment 5' SEGAL was re-amplified using the primers 5'GSP3 and B255, 
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(5'- ATTGGATCCACTCACAC-GTATACACTACAC) 

containing a BamHl site plus the nucleotides position 14-36 in the senna a- 
galactosidase cDNA (Figure 4). 

5 

The fragment 3 'SEGAL was re-amplified using the primers 3'GSP2 and B254, 

(5 '-TTAGC ATGCCCTTGGG ATTGTATT-TCCTC) , 

10 containing a Sphl site and nucleotides position 1353-1373 in the senna a-galactosidase 
. cDNA (Figure 4). 

These PCR fragment were mixed with the fragment SEGAL, and the linked sequence 
was amplified by using the flanking primers B254 and B255. 

15 

The PCR conditions were: For 5 'SEGAL, 1 min at 94°C, 2 min at 57°C, 2 min at 
72°C for 35 cycles followed by 10 min at 72 a C. For 3'SEGAL, 1 min at 94°C, 2 
min at 48°C, 2 min at 72°C for 35 cycles followed by 10 min at 72°C. For the last 
PCR to link the three fragments, 1 min at 94°C, 2 min at 58°C, 2 min at 72°C for 
20 35 cycles followed by 10 min at 72°C. 

The PCR product of the last PCR was analysed by agarose gel electrophoresis, and 
a single product of 1380 bp was obtained. 

25 The DN A product was cloned into the pT7Blue vector (Novagen) and the nucleotide 
sequence was determined using a Termo sequenase fluorescent cycle sequencing kit 
(Amersham) and an ALF DNA sequencer (Pharmacia). 

The DNA fragment was isolated by digestion with BamHI and Sphl and cloned into 
30 the expression vector pPS48, resulting in pPS48SEGAL (Figure 6). 
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A plant transformation vector comprising the expression cassette for senna a- 
galactosidase was constructed as follows. 

The vector pBKL4 (Figure 7) was constructed from pBI121 (Clontec Laboratories) 
5 by deletion of the NPTII cassette and by insertion of a synthetic oligonucleotide linker 
(EcoRl-Clal-Sal[-Hin<Rl\-Spe\-Kpn-BamHi) and a new NPTII cassette containing the 
NPTII gene flanked by a CaMV 35S RNA promoter (Odell et al. Nature 313 (1985) 
810-812) and a polyadenylation sequence from the octopine synthase gene (Caplan et 
al. Science 222 (1983) 815-821) between the GUS cassette and the left border. 

10 

The senna a-galactosidase expression cassette was excised from pPS48SEGAL and 
inserted into pBKIA 

The resulting plasmid pBKL4SEGAL (Figure 8) was transformed into Agrobacterium 
15 tumefaciens strain LBA4404 (Hockema et al. Nature 303 (1983) 179-180) for plant 
transformation. 

Transformation of guar with senna a-galactosidase 

20 The a-galactosidase gene from senna was transformed into guar using Agrobacterium 
tumefaciens transformation, as described in detail by Jdrsboe and Okkels (1994) - the 
contents of which are incorporated herein by reference. 

Analysis of the mannose-to-galactose ratio of endosperms from guar plants 
25 transformed with the senna a-galactosidase gene. 

Pure endosperms of guar plants transformed with the senna a-galactosidase gene were 
analysed by HPLC after hydrolysis in 2 N trifluor acetic acid (see above for details). 

30 The results, which are shown in the following Table, are derived from the analysis 
of endosperms from 4 independent guar transformants harbouring the senna a- 
galactosidase gene. 
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The results are given as % increases of the mannose-to-galactose ratios relative to un- 
transformed control endosperms. 



Transformant 


Relative Increase of the Mannose- 
to-Galactose Ratio 


26-5-12-1 


6.5% 


29-5-08-1 


5.3% 


39-5-45-1 


18.9% 


40-5-02-1 


10.6% 



10 Construction of a plant transformation vector containing the coffee bean o- 
galactosidase. 

An expression vector comprising the coding sequence for coffee bean a-galactosidase 
was constructed as follows. 

15 

The vector pPS48 (Figure 5) was constructed by inserting the 0.75 kb cauliflower 
mosaic virus (CaMV) 35S RNA promoter (E35S) containing a duplication of the -90 
to -420 region (Kay et aL Science 236 (1987) 1299-1302), the 0.21 kb fragment 
containing the CaMV 35S RNA poiyadenylation sequence (Odell et al. Nature 313 
20 (1985) 810-812) and a synthetic oligonucleotide linker (Pstl-Bamm-Smal-SacUSall- 
Sphl) into pUC8 (Vieira and Messing, Gene 19 (1982) 259-268). 

A DNA fragment containing the coffee bean a-galactosidase coding sequence was 
isolated from the plasmid pCR-BZ (Zhu and Goldstein Gene 140 (1994) 227-231) by 
25 polymerase chain reaction (PCR) employing an upstream primer: 

5 1 -TTGG ATCC ACCC AAAA-GCTGGTGCTCC 

(position 15-35 in the coffee bean a-galactosidase cDNA sequence) and a downstream 
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primer: 

5 ' -TTAGC ATGCCTGTTAATC ACTGTGGG 

5 (position 1229-12446 in the coffee bean <*-galactosidase cDNA sequence) resulting 
in a 1,2 kb DNA fragment containing a BamHl site in the 5' end of the a- 
galactosidase.gene and a Sphl site in the 3' end. 

The DNA fragment was cloned into the pT7Blue vector (Novagen) and the nucleotide 
10 sequence was determined using a Termo sequenase fluorescent cycle sequencing kit 
(Amersham) and an ALF DNA sequencer (Pharmacia). 

The DNA fragment was isolated by digestion with BamHl and Sphl and cloned into 
the vector pPS48, resulting in pPS48-GALIII (Figure 9). 

15 

A plant transformation vector comprising the expression cassette for coffee bean a- 
galactosidase was constructed as follows. 

The vector pBKL4 (Figure 7) was constructed from pBI121 (Clontec Laboratories) 
20 by deletion of the NPTII cassette and by insertion of a synthetic oligonucleotide linker 
(EcoRl-Clal-Sall-HindllLSpel-Kpn-BamHl) and a new NPTII cassette containing the 
NPTII gene flanked by a CaMV 35S RNA promoter (Odell et al. Nature 313 (1985) 
810-812) and a polyadenylation sequence from the octopine synthase gene (Caplan et 
al. Science 222 (1983) 815-821) between the GUS cassette and the left border. 

25 

The coffee bean a-galactosidase expression cassette was excised from pPS48-GALIII 
by digestion with Xbal and inserted into Spel digested pBKL4. 

The resulting plasmid pBKL4-GALIII (Figure 10) was transformed into 
30 Agrobacterium tumefaciens strain LBA4404 (Hockema et al. Nature 303 (1983) 179- 
180) for plant transformation. 
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Transformation of guar with coffee bean a-galactosidase 

The a-galactosidase gene from coffee bean was transformed into guar using 
Agrobacterium tumefaciens transformation, as described in detail by Jdrsboe and 
Okkels (1994). 

Analysis of the mannose-to-galactose ratio of endosperms from guar plants 
transformed with the coffee bean a-gaJactosidase gene. 



10 



15 



Pure endosperms of guar plants transformed with the coffee bean a-galactosidase gene 
were analysed by HPLC after hydrolysis in 2 N trifluor acetic acid (see above for 
details). The results, which are shown in the following Table, are derived from the 
analysis of endosperms from 4 independent guar transformants harbouring the coffee 
bean a-galactosidase gene. The results are given as % increases of the mannose-to- 
galactose ratios relative to un-transformed control endosperms. 



. 

Transformant 


Relative Increase of the Mannose- 
to-Galactose Ratio 


4-2-4-2 


4.2% 


4-2-5-5 


4.0% 


4-2-8-2 


6.0% 


1 17-3-6-1 


4.1% 



20 



Discussion of in vivo modification of guar gum 



25 



In the two examples presented above, it is shown that the insertion of an a- 
galactosidase gene into guar can result in the in vivo modification of guar gum as 
evidenced by the fact that the mannose-to-galactose ratios of the transgenic galac- 
tomannans were higher than the non-transgenic control galactomannans. 
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In addition to the example with an phosphomannose isomerase gene (see above), these 
are the first data ever presented on the in vivo modification of a plant cell wall 
storage polysaccharide. 

5 SUMMATION 

The results show that it is possible to form transgenic plants that are capable of 
*: forming guar gum such that the mannose-to-galactose ratio is affected. 

10 In this regard, the PMI transformed lines, the coffee bean a-galactosidase transformed 
lines, and the senna a-galactosidase transformed lines yielded a higher ratio of 
mannose to galactose than the non-transformed lines. For example, the coffee bean 
a-galactosidase transformants had a ratio of mannose to galactose of up to 1.75 - 
compared to 1.65 for the non-transformed lines - and that some of the senna a- 

15 galactosidase transformants had a ratio of mannose to galactose of even up to 2. 
These results are highly surprising. 

As will be apparent to those skilled in the art, the extent to which the in vivo 
modification occurs depend on the activity of the galactomannan related enzymes 
20 encoded by the genes transformed into guar. Thus, substitution or modification of 
■ the promoters or other regulatory nucleotide or amino acid sequences may lead to in 
vivo modifications of galactomannans different from those described in the above 
examples. 

25 Other modifications of the present invention will be apparent to those skilled in the 
- art. 
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As mentioned above. Figure 1 presents both a nucleotide sequence coding for a PMI 
enzyme and the amino acid sequence of that PMI enzyme. For the avoidance of 
doubt, that nucleotide sequence can be referred to as SEQ ID No. 1 and that amino 
acid sequence can be referred to as SEQ ID No. 2. 

As mentioned above. Figure 4 presents both a nucleotide sequence coding for an a- 
galactosidase enzyme and the amino acid sequence of that a-galactosidase enzyme. 
For the avoidance of doubt, that nucleotide sequence can be referred to as SEQ ID 
No. 3 and that amino acid sequence can be referred to as SEQ ID No. 4. 
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INDICATIONS R.ELATINC TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A. Tnc indication made below, relate to the microorganism referred to in ibe descripti 



0. IDENTIFICATION OF DEPOSn 



Further deposits are ideniif cd on an additional sbcsi Q 



Name ot depositary instirution 

The National Elections of Industrial and Marine Baccaria Liciicad (HCIMB) 



Addre&S of deposiury institution (including postal code and country) 

23 Sc. rtachar Drive 
Aberdeen 
Scotland 
AB2 1RY 

United Kingdom 

Date of deposit 



Accession Number 



C. ADDITIONAL INDICATIONS (UnUott if* tfp r iu j, k) This infonn,«ion i, eon.mued on an .ddi.ional ,aeci Q 



In respect of chose designations in which a European patent is sought, and any 
other designated state having equivalent legislation, a sample of the deoosiced 
microorganism will be made available until the publication of the mention of the 
grant of the European patent or until the date on which the application has be-n 
rerused or withdrawn or is deemed to be withdrawn, only bv the issue of such a 
sample to an expert nominated by the person requesting the sample. (Rule 28(4) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ^Micu^.rc^forctt^^S.ata) 



E. SEPARATE FURNISHING OF INDICATIONS (leave U»«i if net applicable) 



Ir«ir°7o U S'r ed be ' 0WW1 " * 1Ubn, " ,ed 10 (ne lnle ™»« i0Ml Bur «« (tpccwAct^iMrcofih* Mitamt c t .. 'Atunim, 



5<1 Tbi * rteei was received with the international application 


Au^tbcHtufofGccr ^2^: 






^Mrs. H. Fransz 



■ For IntemationarBureau use only 
I~l ^« sheet was received by the International Bureau on: 



Authorised officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule llbis) 



A. Tnc indication* made below relate 10 (he microgrpmsm referred r 0 i n loe description 



D. IDENTIFICATION OF DEPOSIT Fuah c 7fl™Z7~ . r S= 
„ t uafte. Qe?B:u an ic-uf— on an addmonal ibis: H] 



Name of depoiiiary iruunjuon 



The National Collections of Industrial and Harine Bacteria Limited O 



Address of depositary insiinition (including postal code end country) 

23 Sc. Machar Drive 
Aberdeen 
Scotland 
AB2 1RY 

Uniced Kingdoa 

Daceofdepos.t ~ ' I Acc«;on Numoer 



(MCIMB) 



/-n r ~^«jn nurnoer 



C. ADDITIONAL LIGATIONS Una. „, W Tnie infection j, on ,» ZZ Z Q 



In respect of those designations in which a European oace-c ^ * n .,*h- a 
other designated state having equivalent legisla 5 ln.1 sa^ole oTthVdeoos^L 

TrZVTrl^ 11 ^ ™ de 3Vailable Until Che P«bli^txo« o cne ae^on o ne 
grant of the European patent or until the date on which the aDDliJr,™ LI V 
refused or withdrawn or is deemed to be withdrawn, onTy g the'issue'of suchT 
sample to an expert nomxnated by the person requesting the sample. (Rule 28(4) 



D. DESIGNATED STATES FOR WHICH IND IC ATIONS ARE MADE^ ,„V M ^ .„ HOtfor . lliaKpmti w , 



E. SEPARATE FURNISHING OF INDICATIONS (lave t,h«i if m tpoSenk) 

^°c!r dc '' owwi ' iM " o "'" rt ' o '" c> "' e ""' w 



• For receiving Office uae only 

|3 T*<* *beei wax received with the international application 




For International Bureau me only 



Q Thia sheet vaa received by the International Bureau < 



Authorized office: 
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^ RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT 

LfcamafcK. issued pursuant CO Rule 7.1 by the 

, INTERNATIONAL DEPOSITARY AUTHORITY 
Identified at the bottom of this page 



I. IDENTIFICATION OF THE HICROORCANISM 



Identification reference given by the 
DEPOSITOR i 

Escherichia coll KU2 
CD-FMI-GO (GDI oBrTccntalnug 



Accession number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY i 



NCIMB 40774 



II. SCIENTIFIC DESCRIPTION ANO/OR PROPOSED TAXONOMIC DESIGNATION 



The nicroorganism identif ied under I above vas accompanied by i 
I j a scientific description 
rT""*| a proposed tasonoaic designation 
(Mark with a cross where applicable) 



III. RECEIPT AND ACCEPTANCE 



This International Depositary Authority accepts the aicroorgenisa identified under I above, 
which was received by it on 9 ^rM^r t^r «*«te of the original deposit) 1 

1995 



IV. RECEIPT OF REQUEST fOR CONVERSION 



The microorganism identified under I above was received by this Ineernatioaal 

Depositary Authority on fdate of the original deposit) and 

a request to convert the original deposit to a deposit under the Budapest Treaty 

was received by it on <date of receipt of request for conversion) 



V. INTERNATIONAL DEPOSITARY AUTHORITY 



NCIMB Ltd 

■3 St Machar Dr- e 
Merdeen S - 

Address: ^ ABT 



Signature! s) of person! s) having the power 
to repiesent the International Depositary 
Authority ot*t «u«IC?laejJ^ftlciai(s) 1 

Date: 



Where Rule 4(d) applies* such date is the date on which the status of international depositary 
authority was acquired. 
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Danisco Biotechnology, 
-angebrogade 1, 
?0 Box 17, 

1001 Copenhagen K, 
Denmark. 



BUDAPEST TREATY ON THE INTERNATIONAL 
RECOGNITION OF THE DEPOSIT Or MICROORGANISMS 
FOR THE PURPOSES OF PATETT PROCEDURE 

INTERNATIONAL FORM 



VIABILITY STATEMENT 
issu ed pursuant to Rult 10.2 by the 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified on the following page 



NAME AND ADDRESS OF THE PARTY 
TO WHOM TOE VIABILITY STATEMENT 
IS ISSUED 



J 



I. DEPOSITOR 


II. IDENTIFICATION OF THE MICROORGANISM 


AS ABOVE 

Address: 


Accession number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY: 

NCIMB 40774 
Date off the deposit or of the transfer: 

9 November 1995 


ZXX. VIABILITY STATEMENT 


The viability of the microorganism identified under II above vas tested 

on 10 November 1995 2 . on that date, the said microorganism was 
l x 1 viable 


3 

1 1 no longer viable 





Indicate the date of the original deposit or, where a new deoosit or a transfer has been 
made, the most recent relevant date (date of the new depositor date of the transfer). 

in the cases referred to in Rule 10.2(a) (li) and (ill), refer to the most recent viability 
test. 

Mark with a cross the applicable box. 



Fora 3?/9 (first page) 
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j IV. COOTITXOHS UNDER WHICH THE VXABX&X7? TEST BXS BESH PE«OW«D 

■ 



i 



V. XSTtRHATXOHAL DEPOSITARY AUTHORITY 


"""" HC1MB Lto 

^t!?^ AB2 1«* 


Signature is) of person (i) having the power 
to represent the International Depositary 
Authority or of augtorlta d^of f IcialU) t 

D * t V^13 November 1995 



4 fill la If the information has been requested and If the results of the test were negative. 



rora BP/9 (second and last page) 



1/4/08, EAST Version: 2.1.0.14 



WO 97/20937 



PCT/EP96/05581 



53 

CLAIMS 

1. An in vivo modification process that affects the mannose-to-galactose ratio of 
either an organism (or part thereof) capable of producing a mannose/galactose 

5 containing compound or of a mannose/galactose containing compound thereof. 

2. An in vivo modification process that affects the mannose-to-galactose ratio of 
either an organism (or part thereof) capable of producing a mannose/galactose 
containing compound or of a mannose/galactose containing compound thereof, the in 

10 vivo modification process comprising expressing a nucleotide sequence coding for a 
gene product, which gene product has an effect on: 

(a) the mannose-to-galactose ratio of mannose and galactose 
components of a mannose/galactose containing compound; and/or 

15 

(b) the mannose-to-galactose ratio of mannose and galactose 
precursors for a mannose/galactose containing compound; 

and wherein the nucleotide sequence is not a natural nucleotide sequence to the 
20 organism (or pan thereof). 

3. An in vivo modification process that affects the mannose-to-galactose ratio of 
either an organism (or part thereof) capable of producing a mannose/galactose 
containing compound or of a mannose/galactose containing compound thereof, the in 

25 vivo modification process comprising allowing a gene product that is capable of 
having an effect on: 

(a) the mannose-to-galactose ratio of mannose and galactose 
components of a mannose/galactose containing compound; and/ or 

30 

(b) the mannose-to-galactose ratio of mannose and galactose 
precursors for a mannose/galactose containing compound; 
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30 



to have an effect on: 54 

(a) the mannose-to-galactose ratio of mannose and galactose 
components of a mannose/galactose containing compound; and/or 

(b) the mannose-to-galactose ratio of mannose and galactose 
precursors for a mannose/galactose containing compound; 

and wherein the gene product has not been expressed by a nucleotide sequence that 
is a natural nucleotide sequence to the organism (or part thereof). 



4. Use of a nucleotide sequence to affect in vivo the mannose-to-galactose ratio 
of either an organism (or part thereof) capable of producing a mannose/galactose 
containing compound or of a mannose/galactose containing compound thereof, 
15 wherein the nucleotide sequence encodes a gene product which has an effect 



on: 



(a) the mannose-to-galactose ratio of mannose and galactose 
components of a mannose/galactose containing compound; and/or 

20 W the mannose-to-galactose ratio of mannose and galactose 

precursors for a mannose/galactose containing compound; 

and wherein the nucleotide sequence is not a natural nucleotide sequence to the 
organism (or pan thereof). 



5 . Use of a gene product to affect in vivo the mannose-to-galactose ratio of either 
an organism (or part thereof) capable of producing a mannose/galactose containing 
compound or a mannose/galactose containing compound thereof, wherein the gene 
product has an effect on: 

(a) the mannose-to-galactose ratio of mannose and galactose 
components of a mannose/galactose containing compound; and/or 
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(b) the mannose-to-galactose ratio of mannose and galactose 
precursors for a mannose/galactose containing compound; 

and wherein the gene product is not expressed by a nucleotide sequence that is a 
5 natural nucleotide sequence to the organism (or part thereof). 

6. The invention according to any one of claims 1 to 5 wherein the 
mannose/galactose containing compound is galactomannan. 

10 7. The invention according to any one of claims 1 to 6 wherein the organism 
capable of producing a mannose/galactose containing compound is a guar plant. 

8. The invention according to any one of claims 1 to 6 wherein the in vivo 
mannose-to-galactose ratio of the organism (or part thereof) or mannose/galactose 
15 containing compound thereof is higher than that of the guar plant or the 
galactomannan thereof. 



9. The invention according to any one of claims 1 to 8 wherein the in vivo 
mannose-to-galactose ratio of the organism (or part thereof) or mannose/galactose 

20 containing compound thereof is substantially similar to that of the locust bean or the 
galactomannan thereof. 

10. The invention according to any one of claims 2 to 9 wherein the gene product 
is at least one gene product useftil in the biosynthesis of GDP-mannose. 

25 

11. The invention according to claim 10 wherein the gene product is the protein 
shown in Figure 1 or is a variant, homologue or derivative thereof. 

12. The invention according to claim 10 or claim 11 wherein the gene product is 
30 coded by the nucleotide sequence shown in Figure 1 or is a variant, homologue or 

derivative thereof and/or is obtainable from NCIMB 40774. 
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13. The invention according to any one of claims 2 to 9 wherein the gene product 
is an a-galactosidase enzyme. 

14. The invention according to claim 13 wherein the gene product is the protein 
5 shown in Figure 4 or is a variant, homologue or derivative thereof. 

15. The invention according to claim 13 or claim 14 wherein the gene product is 
coded by the nucleotide sequence shown in Figure 4 or is a variant, homologue or 
derivative thereof and/or is obtainable from NCIMB 40831. 

10 

16. An enzyme comprising the amino acid sequence shown in Figure 1, or a 
variant, homologue or fragment thereof. 

17. A nucleotide sequence coding for the enzyme of claim 16 or a sequence that is 
15 complementary thereto and/or is obtainable from NCIMB 40774. 

18. A nucleotide sequence according to claim 17 wherein the nucleotide sequence is 
a DNA sequence. 

20 19. A nucleotide sequence comprising the sequence shown in Figure 1, or a variant, 
homologue or fragment thereof or a sequence that is complementary thereto and/or 
is obtainable from NCIMB 40774. 

20. A construct comprising or expressing the invention according to any one of 
25 claims 16 to 19. 

21. A vector comprising or expressing the invention of any one of claims 16 to 
20. 

30 22. A plasmid comprising or expressing the invention of any one of claims 16 to 
21. 
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23. A transgenic organism (or part thereof) comprising or expressing the invention 
according to any one of claims 16 to 22. 

24. A transgenic organism (or pan thereof) according to claim 23 wherein the 
5 organism is a guar plant. 

25. The invention according to claim 10 wherein the gene product is expressed by 
or is the invention according to any one of claims 16 to 24. 

0 26. An enzyme comprising the amino acid sequence shown in Figure 4, or a 
variant, homologue or fragment thereof. 



27. A nucleotide sequence coding for the enzyme of claim 26 or a sequence that is 
complementary thereto and/or is obtainable from NCIMB 40831. 

15 

28. A nucleotide sequence according to claim 27 wherein the nucleotide sequence is 
a DNA sequence. 

29. A nucleotide sequence comprising the sequence shown in Figure 4, or a variant, 
20 homologue or fragment thereof or a sequence that is complementary thereto and/or 

is obtainable from NCIMB 40831. 

30. A construct comprising or expressing the invention according to any one of 
claims 26 to 29. 

25 

31. A vector comprising or expressing the invention of any one of claims 26 to 
30. 



32. A plasmid comprising or expressing the invention of any one of claims 26 to 
30 31. 
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33 . A transgenic organism (or pan thereof) comprising or expressing the invention 
according to any one of claims 26 to 32. 

34. A transgenic organism (or pan thereof) according to claim 33 wherein the 
5 organism is a guar plant. 

35. The invention according to claim 9 wherein the gene product is expressed by 
or is the invention according to any one of claims 26 to 34. 

10 36, A mannose/galactose containing compound when prepared by the process of 
any one of claims 1 to 3 or any claim dependent thereon. 

37. A foodstuff comprising a mannose/galactose containing compound according 
to claim 36. 

15 

38. A composition comprising a mannose/galactose containing compound 
according to claim 36 admixed with another polysaccharide. 

39. A composition comprising a mannose/galactose containing compound 
20 according to claim 36 admixed with any one or more of xanthan, carrageenan and 

agarose. 

40. A method for preparing a composition or a foodstuff comprising mixing the 
mannose/galactose containing compound according to claim 36 with another suitable 

25 ingredient. 

41. A process substantially as described herein. 

42. A nucleotide sequence substantially as described herein and with reference to 
30 Figure 1. 

43. An enzyme substantially as described herein and with reference to Figure 1. 
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44. A nucleotide sequence substantially as described herein and with reference to 
Figure 4. 

45. An enzyme substantially as described herein and with reference to Figure 4. 

5 
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Done on DNA sequence PMI60. 
DS SGP60.R 

DE MODIFIED BY INVERT COMPLEMENTATION FROM 60R. 
DE MODIFIED 3Y INVERT COMPLEMENTATION FROM R06 . 
DE MODIFIED BY INVERT COMPLEMENTATION FROM 50RNY . 
DE MODIFIED BY INVERT COMPLEMENTATION FROM YNRGS . 

Total number of bases is: 1SSS. 
Analysis done on the complete sequence. 
Done on (absolute) phase(s): l. 
Using the Universal genetic code. 



10 20 30 40 50 60 

I I I I I | 

CCA.GTGTGGTGGAATTCCCTTCCCAATTCCCATTTCCTTTCCGTCACTAACCTTCCTTTT 

70 80 90 100 110 120 

I I I I I I 

GCATCAATTCACTCGCTCGCTATCACCATAGCCATGGAGGAAAAGGCTCATGCCACCAAG 

MEEKAH ATK 

130 140 150 160 170 180 

I I I I I I 

CAGCACCAGCATCTTCAGAGGCTTCATTGCTCCGTCAAGAACTACGATTGGGGTTTACCT 

QHQHLQRLHCSVKNYDWGL? 

190 200 210 220 230 240 

I I I I I i 

GGTCGAATTTCCAATGTCGCGAGGCTCTACGCGCTCAATTCCGGGTCCCAATTCCATCCG 

GRISNVARLYALNSGSQFKP 

250 260 270 280 290 300 

I I I | | | 

GATGAG CCTT ACG CGG AACTTTGG ATGGGT AC C C ACG ATT CAG AGC CAT C GTTT CTT GTC 

DEPYAELWMGTHDSEPSFLV 

310 320 J30 340 350 360 

I I I I I I 

TCCAATGGTGCTCAAAGGGT CACC CTCAAGGCTTGGATTT C CCAG AACC CTGATGTGCT A 

i 
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SNGAQRVTLFCAWISQMPDVL 

370 380 390 400 410 420 

I I I I I I 

GGT3AAAAGGTTCTTCAGAAATGGCGTTGTGATCTCCCTTTCT7GTTCAAGGTGC7GTCT 

GEKVLQKWGCDLPFL-KVLS 

430 440 450 450 470 430 

I I I II | 

GTGGGGAAAGCCTTATCCATACAGGCTCACCCGGATAAGGAGTTGGCTAGGACTCTGC^T 

VGKALS IQAHPDKELARTL K 

490 500 SLO 520 530 540 

I I I I | | 

AAATTGCAGCCTAATCTCTATAAGGATGGCAATCACAAACCTGAGATGGCTCTCGCAATG 

5CLQPNLY KDG NKKPEMALAM 

550 S60 570 580 590 600 

11(111 
ACAGATTTCGAAGCTCTTTGTGGATTCATCACTCCrAAGGAGCTCAAGGCTGTGCTTCAT 

T D FEALCGF ITP KELKAVLH 

€10 620 630 640 650 660 

I I I I I | 

ACTGTTCCTGAAGTTGTTGAACTGGTTGGCGCTGCAAA^ 

"TV PEVVE LVGAANAKL'l LQT 

570 680 690 700 710 720 

I I I I | | 

AGAGATCAGGATGGTGAAGAAAAGGTAAAACCTGTT7TGCAGACAGTATTCACCCACCTC 

ROQDGEEKVKPVLQTVFTKL 

730 740 7S0 760 770 780 

I I I I I | 

A7GTCAGCT AGT AAAGAGAAAGTAACTGATGCAGTAAACAGATTGAAAAGT CGT CTGCAT 

M S A S KEKVTOAVNRLKSRLK 

790 800 8L0 820 830 840 

I I I I I I 

AAGGAAAGTGAGGTGAGGCAGTTGACAGATAAGGAGCAGCTAGTGCTGCGATTGGAAAAG 

KSSEVRQLTDKEQLVLRLEK 
850 860 870 880 890 900 
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I i ! I I I 

CAA7ACCCAGC7GA7G77GG7G7CA7AGC7GCC77C777C77AACC77G7AAAAC7CAG7 
QYPADVGV IAAFFLNLVKLS 

910 920 930 940 950 960 

1 I I I I I 

CC7GG7GAAGCA77G77CC7AGGGGCAAACGAACCACACGCA7A7C7ACA7GGGGAG7GC 

PGSALFLGANEPKAYLKGEC 

970 980 990 1000 ■ 1010 1020 

I I I I I | 

ATTGAA7GCA7C«»AC77CAGACAA7G7TGTGCGAGCTGGCC7ATC7CCCAAACACAGA 

I ' S C M A T S DNVVRAGLSP K H R 

1030 1040 1050 1060 1070 1080 

I I I ! I I 

GA7G7CCAGACCC7T7G77CCA7GC7TACA7ACAAACAGGG77C7CCAGAGA7A77GCCG 

DVQ7LCSML7YKQGS ?£ I L P 

1090 1100 1110 1120 1130 1140 

I I I I I I 

GGAG77CCTC7AAATCCA7A7GTAAAAAAA7ACACCCCGCCArrCAAGGAA7TTGAGA7T 

G VPLNP YVKKY7PPFKSFEI 

1150 1160 1170 1180 1190 1200 

I I I I II 

GA7AGA7G7A77C77CC7CAAGGGAAAACAG7GG7G77CCCAGCAG7GCCAGG7CC77C7 

DRCILPQGK7VVF PAVPGPS 

1210 1220 1230 1240 1250 1260 

I I I I I I 

A7C77777GG7CACAG77GGGGAAGGAG7GA7GAA7ACAGGA7CACCA7CCAAAGAACAA 

IFLVTVGEGVMK-TGSPS'KSQ 

1270 1280 1290 1300 1310 1320 

I I I I I I 

G77G7C7C7GAAGG7GA7G7CC77777GCAGC7GC77ACAC7GAGA77AG7G77AC7AG7 

V VSS GDV LF A A A Y 7 E I S V7S 

1330 1340 1350 1360 1370 1380 

I I I I | I 

GCA7C7GAG77GCA7C7G7ACAG AACGGGAG77AA7AG CAGA777777 CAGG C77C C7 AA 

ASELHLY R7GVNSR F FQ AS - 
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1390 1400 1410 1420 1430 1440 

I I I I I I 

GTTTCAACTGAACGTTCCTTTTTTAGGGGCCA7GGCTTTTGAGAGTCAAACTGTGCAATT 



1450 1460 1470 1480 1490 1500 

1 I I I I I 

GTGTGTTATATATGTAATAGTATTAGTAGCAGTT7AGTGTACATGGAGTAATAACTTAGT 



1510 1520 1530 1540 1550 

I I I I 1 

CTCAGAGAAAAGTTCATCCAAAGAGAGGTCGGTTGCCAAAAAAAAAAAAAAAAAAA 
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J ,ifT*::i**MM tr.r pcO.NA j| 
•it-.: c:uc»40lid«: -1 

ftVS is STf ^(i'jMC* 0< pcCNA i| 

T? prehear 3<u«s 29«.;oi> 
Lac Z <;en* 5as«* 2332-jOO 
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60 



* TRANSLATION OF A NUCLEIC ACID SEQUENCE * 

Done on DNA sequence SECDNA5 . 
Total number of bases is: 1630. 

10 20 30 40 50 

I I I I | , 

GGGGGGGGGGATAACACTCACACGTATACACTACACACACACTCATATATATATAGAGAG 

70 80 90 100 110 120 

I I I I | , 

AGAGAGAGAGAGAGAGATTTATATAGAAAGAAATGGAGAAAATGATGATGTGGGCAAAGG 

MEKMMMWAK 

130 140 150 160 170 180 

I I I I I I 

TTGTGTTGTGCTTGTTTTGGGTCTTGAATGCTTCTAATTGTTCAGGTCGCTTGTTGAACA 
VVLCLFWVLNASNCSGRLLN 

190 200 210 220 230 240 

I I I I I I 

CAATTGGCAATGATCACAACAACATCCATGGAAGACTACTTCTTGGAAATGGACTTGGAA 
TIGNDHNNIHGRLLLGNGLG 

250 260 270 280 290 300 

' I I I I | 

ACACTCCTCCCATGGGATGGAATAGCTGGAACCACTTTCAGTGTGACATTAATGAGGAGA 
NTPPM GWNSWNHFQCDINEE 

310 320 330 340 350 360 

I I I I I I 

TGGTTCGAGAAACAGCTGATGCAATGGTGTCAACGGGTCTTGCATCTTTGGGGTACGAAT 
MVRETADAMVSTGLAS LGYE 

370 380 390 400 410 420 

I I I I I I 

ACGTCAATTTAGATGATTGCTGGGCTGAACTTAACCGAGACTCTAAGGGAAATATGGTTC 
YVNLDDCWAELNRDSKGNMV 

430 440 450 460 470 480 

' I I I I | 

CTAGTGCTTCAAAATTTCCTTCAGGAATTAAGGCTCTGGCTGATTATGTTCATAGCAAAG 
PSASKFPSGIKALADYVHSK 

490 500 510 520 530 540 

' I I I I I ' 

GATTGAAGTTTGGGGTTTATTCTGATGCTGGAAACCAAACATGCAGTAAAGCTATGCCTG 
GLKFGVYSDAGNQTCSKAMP 

550 560 570 580 590 600 

' ' I I I I I 

GATCACTTGGACATGAAGACCAAGGTGCAAAAACATTTGCTTCCTGGGGGGTTGATTTCT 
GSLGHEDQGAKTFASW. GVDF 
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610 620 630 640 650 660 

I " I I I I 

TGAAGTATGACAATTGTAACAACAATGATATAAGCCCAAGAAATAGGTATCCAAAAATGT 
LKYDNCNNND I SPRNRYPKM 

670 680 690 700 710 720 

' ' I I l | 

CTGAAGCTCTAGCAAATTC7GGAAGGGCAATCTTCTTCTCTATGTGTGAATGGGGATCAG 
SEALANSGRA I FFSMCEWGS 

730 740 750 760 770 780 

'''III 
AAGACCCTGCACTATGGGCCAAAAGTGTGGGAAATAGTTGGAGAACAACTGGAGATATTG 
EDPALWAKSVGNSWRTTGD I 

790 800 810 820 830 840 

i ' I I I I 

AAGATAAGTGGGAAAGTATGGCATCTATTGCTGACCAAAATGACAAATGGGCATCTTATG 
EDKWESMAS IADQN DKWASY 

850 860 870 880 890 900 

I I I I I ! 

CTGGGCCTGGAGGGTGGAATGATCCTGATATGCTTGAAGTTGGAAATGGAGGCATGACAA 
AGPGGWNDPDMLEVGNGGMT 

910 920 930 940 950 960 

' I I I I I 

CAGAAGAATATCGTTCTCATTTTAGCATATGGGCATTAGCTAAGGCTCCTTTGTTGA7TG 
TEEYRSHFS IWALAKAPLLI 

970 980 990 1000 1010 1020 

I » I I I I 

GTTGTGATGTTCGATCAATGGATGGCGCAACATACGGACTGCTAAGCAACAAGGAAGTTA 
GCDVRSMDGATYGLLSNKEV 

1030 1040 1050 1060 1070 1080 

I I I I | | 

TTGCAGTAAACCAAGACAGTCTAGGAGTCCAAGGAAAGAAGGTGAAAAGTGATGCTGGTT 
IAVNQDSLGVQGKKVKS DAG 

1090 1100 1110 1120 1130 1140 

I I I I I I 

TGGAGGTTTGGGCAGGTCCTCTGAGTGATAACAGAGTGGCAGTGGTGTTGTGGAATAGAA 
LEVWAGPLS DNRVAVVLWNR 

1150 1160 1170 1180 1190 1200 

I I I I I I 

GTTCATCAAAAGCTACTGTGACTGCATCCTGGTCTGACATAGGGCTTGAAAAAGGAAAGG ' 
SSSKATVTASWSDIGLEKGK 

1210 1220 1230 1240 1250 1260 

I I I I I I 

TGGTCACTGCAAAAGATTTATGGGAGCACACTACAAAAGCATCAGTTTCAGGACAAATTT 
VVTAKDLWEHTTKASVSGQI 
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1270 1280 1290 1300 1310 1320 

I I I I I I 

CTGCAGATATAGATTCACATGCTTGTAAGATGTATGTTCTGACTCCCAATTAAGGCAGAC 
SADI DSHACKMYVLTPN- 

1330 1340 1350 1360 1370 1380 

I I I. I I I 

AGGAAGGTGATGAAAGCCAAGGTTTTAAGGAAGAGGAAATACAATCCCAAGGGTTCCAAA 

1390 1400 1410 1420 1430 1440 

I I I I I I 

AGAGGGTGGGAGAAAATAAAACATGGGAGTTTATTTTTTCAATAAGGAATATATATGGAA 

1450 1460 1470 1480 1490 1500 

I I I I I | 

ATAATTTAAATAAAGATAAGAAATTCGCCCTGAGATTTCTATACATTATTTGTATTATTG 

1510 1520 1530 1540 1550 1560 

I I I I I I 

GAGCATTGAGATTTGAGACCTTTTGATTTCAATTCAATAATTATACCGCAAATGTTTCTC 

1570 1580 1590 1600 1610 1620 

I I I I I I 

TATATTTTCCATAATTATGGTGATTATAATAAATGTTAAATTTGCTTGATGAAAAAAAAA 

1630 
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